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Abstract 

D 

We  consider  a  linear  regression  model.  We  attempt  to  measure  the  influence  on  the 
residual  for  the  full  model  and  the  reduced  model.  A  criterion  to  select  important  inde¬ 
pendent  regression  variables  is  also  derived  using  the  influence  diagnostics.  This  criterion 
turns  out  to  be  the  same  as  the  one  proposed  by  Gupta  and  Huang  (1988). 
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1.  Introduction 

We  consider  the  following  linear  model 

Y  =  XP  +  c,  (1) 

where  £  ~  N(0,aoln),  In  denotes  the  identity  matrix  of  order  n,  7  is  an  n  x  1  vector  of 
responses,  X  is  a  n  x  p,  (n  >  p),  matrix  of  known  constants  of  rank  p,  0  is  a  p  x  1  parameter 
vector.  Several  authors  have  studied  the  influence  on  the  fitted  regression  line  when  the 
data  are  deleted.  Let  /?  be  the  usual  least  squares  estimator  of  based  on  the  full  data 
and  let  0A  be  an  alternative  least  squares  estimator  based  on  a  subset  of  the  data.  The 
empirical  influence  function  for  I Fa  is  defined  to  be 

IF*  =  L~l  (2) 

For  a  given  positive  definite  matrix  M  and  a  nonzero  scale  factor  c,  Cook  and  Weisberg 
(1980)  defined  the  distance  Da{M,c)  between  $  and  $_A  as  follows: 

c)  =  OMMMM.  ,3) 

c 

Cook  and  Weisberg  (1980)  suggest  that  the  matrix  M  can  be  chosen  to  reflect  specific 
interests. 

Cook  and  Weisberg  (1980)  pointed  out  that  in  some  applications,  measurement  of  the 
influence  of  cases  on  the  fitted  values,  ¥_  =  X$,  may  be  more  appropriate  than  measuring 
influence  on  $.  They  mentioned  an  example  to  describe  the  fact  that  if  prediction  is 
the  primary  goal  it  may  be  convenient  to  work  with  a  reparameterized  model  where  the 
regression  coefficients  are  not  of  interest.  Cook  and  Weisberg  (1980)  tried  to  treat  their 
measurement  of  the  influence  on  the  fitted  values  X&.  They  used  the  empirical  influence 
function  for  Y_  as  defined  by  X(IFa)-  In  this  paper,  we  attempt  to  measure  the  influence 

*  This  research  was  supported  in  part  by  the  Office  of  Naval  Research  Contract  N00014- 
88-K-0170  and  NSF  Grants  DMS-8606964,  DMS-8702620  at  Purdue  University. 
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on  residuals  or  on  X fi.  The  large  influence  on  the  residual  should  have  much  influence  on 
$  though  the  converse  may  not  hold.  Furthermore,  Welsch  (1982)  pointed  out  that  in  an 
earlier  paper  Cook  (1977)  chose  to  measure  influence  by 


D  = 


s2p 


(4) 


where  s2  is  the  residual  mean  square  for  full  data.  Welsch  (1982)  gave  an  example  to 
explain  that  when  all  of  the  observations  but  one  lie  on  a  line,  (4)  can  give  potentially 
confusing  information  since  it  may  indicate  that  some  observations  on  the  line  are  more 
influential  than  the  one  observation  not  on  the  line.  This  is  counterintuitive  since  the 
deletion  of  this  one  observation  leads  to  a  perfect  fit.  Welsch  (1982)  gave  the  example  as 
follows: 

Let  xi  =  x4  =  10,  X2  =  11,  Z3  =  —11;  yi  =  10,  t/2  =  11,  1/3  =  —11,  t/4  =  10.5  and  fit 
a  simple  linear  regression  model  with  intercept.  The  fitted  simple  regression  line  is 


Y  =  0.0885  +  1.0073X, 


R2  =  0.9995,  Root  MSE  =  0.2909,  C.V.  =  100(Ro^t  ^SE)  _  5.68%) 

and  the  values  of  Cook’s  D  are:  (l)  0.109,  (2)  0.144,  (3)  178.495,  (4)  0.477.  We  find  that 
the  observation  3  is  much  more  influential  than  observation  4.  Therefore,  finding  a  more 
reasonable  measurement  is  very  important.  We  shall  consider  the  case  of  one  data  deletion 
at  one  time.  Since,  for  the  deletion  of  any  subset  case,  computations  can  be  similarly 
carried  out,  we  refer  to  Cook  and  Weisberg  (1980),  Gray  and  Ling  (1984). 

Next  we  propose  a  selection  criteria  to  combine  the  influence  measure  and  variable 
selection.  We  derive  a  suitable  choice  of  M  and  c  in  (3)  to  measure  the  influence  and  bias 
for  the  reduced  model.  Then,  the  inferior  reduced  model  can  be  determined.  An  example 
(Daniel  anu  Wood  (1980))  is  studied  to  explain  the  idea  for  the  proposed  criteria. 


2.  Influential  Observations  in  Linear  Regression  Model 


Let 


X 

nxp 


-  dirC"  l  -  mtr- 


3 


Vi  -  Xl0,  «'  =  1,2 . n,  £  =  (X'XJ-'X'y.  Then 

S(0£(0  =  (£<0  -  AT(i)£)'(ll(.)  -  X(0£) 

=  uiw  -  *(,)|(() + jf(olw  -  -  *(,4, 

+  x<.>£<„  -  *<.$ 

=  (£(0  -  ^ol'CCio  -  -Xcolto) 

+  (xwlw  -  *<o£)'(*co2<o  -  x<o2). 

Since  (r(0  -  *(o£,0)'*M  =  H'<.)x<0  -  £'(i)X(0X<0  =  0.  Thus 

-(*)-(*)  =  SSE(i)  +  ^(»)» 


where 

SS£(0  =  (r,o  -  x^YiXm  -  *w£w) 

and 

55£(0  =  ^ -(»)-(»)* 


We  have, 


Define 


g(»)g(»)  _  e  ■  *(Q 

SSE{i)  SSE{i)' 


D(i)  = 


(«) 


*sh 


where  5?.i  = - 

' ’  n  —  p  —  1 


sse 


(«■)• 


(5) 

(6) 


If  .D(,)  is  large,  as  in  (5)  we  have  that  deleted  i-th  data  will  heavily  influence  the  fitted 
line.  We  rewrite  ify)  as  follows: 


%)  =  (l(0-2)'xwxw(l(o-2) 

=  [x(o(l„,-2)l'lx(o(i(0-2)l 

=  (xm1(„  -  -  *<o2)-  (7) 


Thus,  i?(<)  is  the  Euclidean  distance  between  X{i)§_  and  its  ordinary  least  square  estimate 
(OLS)  Now,  we  use  the  full  data  to  estimate  /?  as  the  true  value,  we  obtain  the 

OLS  estimate  ft. 
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We  define  a  statistic  to  measure  the  influence  in  (5)  for  the  fitted  line  as  follows: 


*,  4)  <1(0-^!  oXm&m-to 

w  Psf.)  P°h 

(8) 

Let  the  hat  matrix 

H=  X  (X'  X)-'  X'  =  (M«x». 

nXp  pxp  pxn 

where 

i,j= 

We  have 

(X’(i)X(i))-‘  =  (X'X  -  XiX,!)-1 

,Y,v,-i  .  (X'X)-%2C'(X'X)-‘ 

+  1  -ha 

x:(xit)x(tl)-‘x.  = 

(9) 

Cov  (£)  =  XCX'X)-1*'*2  =  Ha 2, 

Var  (y.)  =  2C-(X'X)-l2C,a2  =  ^.icr2,  »  =  1, 2, . . . ,  n, 

and 

Cov  (£)  =  (X'X)"lo2. 

Thus 

4  =  (x;0x(i,r‘x;0i:(i) 

=  (X'X  -  XiXfi'l{X’Z  -  XiYi) 

s  (X'X)"1^,  ,  „  „ 

-  £  .  .  ,  where  et  -  Yx  X,£, 

1  —  n{i 

hence 

*  a  (x'x)-‘xa 
-(»)  £  1  -  ’ 

(10) 

The  t-th  predict  residual  is 

«(»)  =  ~  2C^(,) ,  *  =  1, 2, . . . ,  n. 

(ID 

Then 

-  y  y,h  y  y,(*  (X'X)-'X,e 

ew  -r<- Mo  {$■  — ~h~r 
mYi 


5 


and 


,r  x  Var  (e<)  a2 

Var  <'«>  =  (TT53 t  -  (T 


Thus,  we  can  obtain  the  following  result, 


,  .  „  ,  (Jf'ATJ-'XA 

- 1 - TTaT-1  x«x«I - 

=  (1  -  hn)hue2(i). 


D(i)  = 


(1-huV 


The  first  factor  in  (14)  is  the  studentized  residual,  that  is,  the  residual  divided  by  its 
standard  error  based  on  a  fit  to  the  data  with  the  *-th  case  excluded,  while  the  second 
term  is  the  leverage  of  the  variance  of  the  t-th  predicted  value. 

Since  and  are  independent  (cf.  Graybill  (1976)),  and  F,  and  s2^  are  also 
independent.  We  have  «(»)  and  s2^  are  independent.  Thus 

_  *(») _ g(») 

‘  “  _ _ 1 _  “  iii  . 

(0  (i -A..)* 

is  i-distribution  with  n  —  p  —  1  degrees  of  freedom.  Hence  t2  is  F-distributed  random 
variable  with  degrees  of  freedom  1  and  n  —  v  —  1. 

For  given  a,  let  Ci  be  satisfied  the  following  equation: 


P{f>0)  >  Ci)  =  a. 


Since  D (,•)  =  t2hu  •  we  have 


P{D(i)  >  Ci}  =  Ptfhu  •  -  >  Ci}  =  P{t2  >  ~}  =  a, 

P  ”•»» 


Hence 


=  F(l,  n-p  -  1;  1  -  a). 


Ci  =  -huF{  1,  n  -  p  -  1;  1  -  a). 
P 
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From  equation  (5): 


=  i  ,  _%L_ 

SSE{i)  SSE(i) 


The  estimated  value  of  ■■  is: 


n  —  p 


f>  >  1  + - — -  ■  -h«F{  1,  n  -  p  -  1;  1  -  a) 

1  '  n  —  p  —  1  p 

=  1  + - - — -huF(  1,  n  -  p  -  1;  1  -  a). 

n  —  p  —  1 


Thus,  the  *-th  data  is  deleted,  it  will  be  the  influential  data,  if  £>(,•)  >  C,.  In  the  example 
(Table  III)  below,  we  find  that  the  observation  number  i  =  29  is  an  influential  data.  The 
amount  of  the  influence  for  the  residual  will  be  at  least  haF(  1,  n  —  p  —  1;  1  —  a). 

From  (17),  we  define 

TF  -  ^{i) 

/f(i)  _  — 

as  the  measure  of  the  strength  of  the  influence  on  the  residual  when  the  i-th  data  is  deleted. 


We  shall  compare  D(,)  with  Cook’s  D  in  TABLE  (I)  and  TABLE  (II)  for  the  Welsch 
(1982)  example.  We  can  find  that  D (4)  reflect  the  influence  very  seriously  on  the  fitted 
line.  If  we  deleted  4-th  data,  the  line  is  fitted  perfect.  Hence,  the  £>(,•)  will  be  sensitively 
reflect  the  bias. 

TABLE  I 


Obs. 

Residual 

RSTUDENT 

HAT  DIAG  H 

1 

-0.1615 

-0.5432 

0.3231 

2 

-0.1689 

-0.5948 

0.3553 

3 

-0.0080 

-0.5837 

0.9985 

4 

0.3385 

oo 

0.3231 
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TABLE  II 


Obs. 

1 

0.0476 

26.0810 

0.0018 

2 

0.0629 

28.6802 

0.0022 

3 

0.1701 

80.6001 

0.0021 

4 

oo 

26.0810 

oo 

3.  Selecting  Important  Independent  Variables 

We  shall  consider  the  influential  data  in  the  reduced  model.  For  the  selection  of 
important  independent  variables  in  (1),  it  is  necessary  to  consider  the  measurement  of  the 
influence. 

We  denote  model  (l)  as  Y'  =  [Ylt...,Yn),  X  =  [1,X1 . 0'  =  [0o,0lt . . . , 

0P- 1]  and  e  ~  N(0,(jQln),  here  /„  denotes  the  identity  matrix  of  order  n  x  n.  The  model 
(1)  having  p—  1  independent  variables  is  considered  as  the  true  model.  Any  reduced  model 
whose  lX  matrix’  has  r  columns  is  obtained  by  retaining  any  r  —  1  of  the  p  -  1  independent 
variables  X\, . . . ,  Xp-i,  where  2  <  r  <  p  —  1.  For  each  r,  2  <  r  <  p  —  1,  there  are 
kr  —  (^3i)  such  models.  These  kr  reduced  models  of  ‘size’  r  are  indexed  arbitrarily  with 
the  indexing  variable  l  going  from  1  to  kr.  We  will  refer  to  a  typical  model  as  Model  Mre. 
If  the  t-th  data  is  deleted,  then  the  reduced  Model  Mrt  is  denoted  by  Mr *(,).  A  reduced 
model  of  size  r  can  be  written  as 

E(  n  =  XrtPrt,  £=l,2,...,*r.  (18) 

The  reduced  model  for  deleted  *'-th  data  is 

E(Y(,))  =  Xre(i)0r(^y  i  =  1, 2, . . . ,  kr.  (19) 

It  should  be  pointed  out  that  all  expectations  and  probabilities  are  calculated  under 
the  model  (1). 

Usually,  we  use  the  residual  sum  of  squares  to  measure  goodness  of  the  fitted  model 
for  a  random  sample.  Hence,  the  expected  residual  sum  of  squares  is  naturally  considered 
as  the  measurement  for  the  goodness  of  fit.  Large  values  of  this  expectation  are  not 
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desirable.  But,  the  estimate  of  the  expectation  is  heavily  influenced  by  the  influential 
data.  It  is  important  to  detect  them,  and  consider  them  seriously.  It  should  be  first  noted 
that  our  comparisons  of  models  are  made  under  the  true  model  assumptions. 

For  any  r,  2  <  r  <  p  1,  the  residual  sum  of  squares  Srt  and  Sri^  for  the  reduced 
models  Mrt  and  1  <  l  <  kr,  i  =  1,2, . . .  ,n,  are  respectively  as  follows: 

SSri  =  Y'QrtY_,  and  Srt (t)  =  (20) 


where 


and 


Also 


and 


Qrl  =  [In  -  Xrt(X'r(Xrt)-lX'ri), 


Qrl(i)  ~  [^n—l  ~  Xrt(,)(X'e(i)Xr£(i))  1X'i(ijJ. 


SSr‘  2/  .  , 

— 2~  ~  X  {n  -  r,  Ar<}, 


(21) 


where 


and 


Ar,  =  [X0)'Qrt{Xp)l2ol, 


A  n(i)  =  (XP)'Qrl{i)(XP)/2al 


We  note  that  Qrl  and  Qrt(i)  are  idempotent  and  symmetric;  thus  it  is  positive  semi-definite. 
Hence  A and  ^ri(i)  are  nonnegative,  but  not  zero,  in  general. 


We  have 


£[S5r<j  =  (n  -  r)<7o  +  2o%\rt, 


and 


(22) 


—  (n  -  r  -  1)oq  +  2ol\rt(iy 

Since  al  is  fixed,  it  is  clear  from  (22)  that  \ri  and  Ar<(t),  for  all  i,  should  not  be  large  for 
Mri  as  a  good  model. 
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Gupta  and  Huang  (1988)  have  proposed  some  selection  procedures  for  selecting  good 
models  based  on  Ar<’s.  Now,  we  are  interested  in  studying  the  t-th  data  is  deleted,  how 
large  influence  for  Ar*(,)! 

We  have  the  unbiased  estimate  of  A rt,  aft,  °o(»)  ^r*(0  35  f°H°ws: 


and 


Since 


.2  SSpi  SSE  SSP  i(0  SSE{i) 

°°  n  —  p  n  —  p  ’  n  —  p—  1  »  —  p  —  1 

?  n  —  p  55r*  ri  —  r 

Xr<  =  2  55pl  “  2 


-  rt-p-l 

A  rt(i)  = - ^ - 


SSrt(j) 

55pi(0 


rt  —  r  —  1 


2 


(23) 


(*£  -  Xre(i)&rt(i)yQrt(i)(X£  ~  =  2 

Hence,  Ar<(.)  also  measures  the  influence  of  the  t-th  data  on  fitted  values.  We  define  the 
measurement  of  the  influence  for  the  t-th  data  as  follows: 


(XpYQrl{i)(X§) 
Dre(i)  ~  Mo  ~  2al 


(24) 


We  estimate  Drt{i)  in  (24)  as  a  statistic  to  measure  the  influence  as  follows: 
-  -  _  rt  -  p  -  1  S ■?r<(»)  n-r  -  1 

Drt(i)  =  Mo  -  2  SSpiw  2 

We  can  compute  a  constant  d  to  satisfy  the  following  equation: 

inf  P{Drt( o  >d}  =  l-a. 


(25) 


(26) 


where  A  >  0  and  a  are  given. 

We  have  in  (25), 

P{Drtli)  >  d|Ar*(o  =  A}  =  1  -  a.  (27) 


V  _  [SSfl(i)  ~~  SSpl(i)]/{P  r ) 

r/(,)  “  55plW/(rt-p-l)  ’ 
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follows  the  noncentral  F  denoted  as  F'(p-r,  n-p-  1;  (cf.  Graybill  (1976)).  Then, 

we  have 


n-r-1,  2  .n-p— 1 

((<*  + - 2 - )~pTi  “  X1 — I — I —  =  F(p-r,  n-p-  1;  A) 


p-r 


(28) 


From  (28),  we  have 


d  = 


(n-p-  1) 


-  r  -  1 


(29) 


Patnaik  (1949)  provided  an  approximation  to  the  noncentral  F  distribution  (cf.  Guenther 
(1979))  by  the  relation 


F'(p  —  r,n  —  p  —  1;  A)  «  {[(p  -  r)  +  2A \/p-r}F(p\  n-p  -  1), 
where  p*  =  [(p  -  r)  +  2A]2/[(p  -  r)  +  4A]. 


(30) 


Hence  the  constant  d  can  be  computed  as  follows: 

(n  -  p  -  1)  f  (p  -  r)  +  2A  *  )  n  -  r 

We  summarize  the  results  as  follows: 

If  the  :-th  data  is  deleted,  and 


(31) 


£>r(( i)  >  d,  (32) 

then  there  exists  an  influential  data  in  the  reduced  model  Mri. 

A  reduced  model  Mri  is  called  an  inferior  model,  if  there  is  some  j-th  data  which 
satisfies  the  condition  (32),  where  i-th  data  is  not  an  influential  data  in  model  (l).  A 
method  to  select  important  independent  regression  variables  is  given  in  Gupta  and  Huang 
(1988). 

To  summarize,  the  selection  processes  are  as  follows: 

Using  Gupta  and  Huang  (1988)  procedure,  we  select  some  desirable  reduced  models 
denoted  by  T  at  stage  1.  If  any  model  in  T  is  an  inferior  model,  then  we  reject  it.  The 
set  of  the  remaining  models  is  denoted  by  T' .  At  stage  2,  from  the  set  T\  we  select 
the  reduced  model  associated  with  the  smallest  total  error  using  the  statistic  fr£,  where 
ti  =  2  •  +  (p-  r)]  -  (2  p  -  3r). 
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We  shall  take  an  example  for  the  selection  of  influential  data  in  Daniel  and  Wood 
(1980,  p  234).  The  data  were  obtained  in  a  laboratory  study  of  the  distillation  properties 
of  various  crude  oils  with  respect  to  their  yield  of  gasoline.  The  four  independent  variables 
measured  were: 

X\:  crude  oil  gravity,  °API, 

X2'  crude  oil  vapor  pressure,  psi, 

X3:  crude  oil  ASTM  10%  point,  °F, 

X4:  gasoline  ASTM  end  point,  °F, 

Y :  gasoline  yield,  as  percentage  of  crude. 

We  fit  the  full  model  for  the  data  as  follows: 

Y  =  -6.952  +  0.229Xi  +  0.553X2  -  0.149X3  +  0.155X4. 

R 2  =  0.96,  Root  MSE  =  2.231,  C.V.  =  ^oot^SE  x  10o%  _  n.34%. 

where  Y  is  the  sample  mean  of  Y,’s.  We  consider  the  reduced  model  as  in  Daniel  and 
Wood  (1980,  p.  247): 

Y  =  70.84  -  0.212X3  +  0.159(X4  -  332)  (33) 

R2  =  0.95,  Root  MSE  =  2.426,  C.V.  =  12.338%. 

In  the  reduced  model  (33),  there  is  no  any  influential  data. 

We  have  computed  some  values  in  TABLE  III  and  TABLE  IV  to  show  some  idea  for 
the  various  statistics  in  the  previous  discussion. 

We  state  the  notation  in  the  following  table  as  follows: 

A 

Residual  =  Y,  —  Y,-, 

RSTUDENT  = 

*<o 

HAT  DIAG  H  =  hu, 

D(i)  =  (RSTUDENT)2  x  (HAT  DIAG  H)/p, 

Ci  =  ±hiiF(  1,  32-5-1;  0.95), 

7F(t)  = 

where  F(  1,  26;  0.95)  =  (2.056)2,n  =  32  and  p  =  5. 
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For  the  reduced  model  M31  in  (33),  and  from  (25),  we  have 


.  32  -  5  -  1  150.4016 

£>31(29)  —  - X  X 


111.657 


=  3.51, 


32-3-1 

2 


and  from  (31), 

let  A  =  1.3,  p*  =  [(5  -  3)  +  2A]2/[(5  -  3)  +  4A]  =  2.94,  and  let  a  =  0.05, 

F(2.94,  26;  0.95)  «  3.00 

we  have  d  «  5.9.  Thus,  £>31(29)  <  d.  We  have  checked  that  £>31(t)  <  d  for  all  *  =  1, . . . ,  32. 
Hence,  the  reduced  model  (33)  is  reasonable  to  accept  as  a  good  model  (not  an  inferior 
model).  Note  that  the  value  of  A  can  be  chosen  as  in  Gupta  and  Huang  (1988).  The  value 
of  A  is  the  amount  of  bias  for  a  reduced  model  in  (26). 
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TABLE  III 


Obs.  Residual  RSTUDENT  HAT  DIAG  H 


1 

-1.8281 

-0.8754 

0.1340 

2 

-3.5300 

-1.6607 

0.0361 

3 

1.4166 

0.6806 

0.1494 

4 

-1.2104 

-0.5915 

0.1814 

5 

1.2999 

0.6801 

0.2829 

6 

2.8220 

1.4356 

0.1956 

7 

-0.6043 

-0.2829 

0.1170 

8 

-0.5769 

-0.3001 

0.2851 

9 

0.6229 

0.3067 

0.2014 

10 

-3.5804 

-1.7239 

0.0730 

11 

-0.4090 

-0.1844 

0.0495 

12 

0.3604 

0.1680 

0.1114 

13 

2.8245 

1.4179 

0.1755 

14 

0.2360 

0.1117 

0.1390 

15 

-0.3763 

-0.1734 

0.0901 

16 

-1.3024 

-0.6501 

0.2134 

17 

1.0405 

0.5009 

0.1598 

18 

-2.8326 

-1.3429 

0.0823 

19 

3.3608 

1.6105 

0.0763 

20 

2.9376 

1.3887 

0.0729 

21 

-1.7290 

-0.8222 

0.1248 

22 

-1.4534 

-0.7120 

0.1808 

23 

-2.1059 

-1.0936 

0.2519 

24 

-2.8999 

-1.3971 

0.1067 

25 

1.9063 

0.9332 

0.1682 

26 

0.2721 

0.1356 

0.2232 

27 

1.4451 

0.7112 

0.1881 

28 

-2.1919 

-1.0538 

0.1299 

29 

4.6214 

2.2937 

0.0586 

30 

-0.0500 

-0.0261 

0.2897 

31 

-0.1696 

-0.0809 

0.1521 

32 

1.6838 

0.8975 

0.3000 
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Obs. 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 
*29 

30 

31 

32 


TABLE  IV 


D(i) 

Ci 

SSE^) 

"<0 

0.113 

0.181 

0.031 

121.5 

0.126 

132.0 

0.109 

0.013 

0.153 

132.6 

0.083 

0.026 

0.239 

132.0 

0.109 

0.081 

0.165 

124.5 

0.488 

0.002 

0.099 

134.0 

0.019 

0.005 

0.241 

133.9 

0.021 

0.004 

0.170 

133.9 

0.022 

0.04 

0.062 

120.7 

0.703 

0.000 

0.042 

134.2 

0.008 

0.001 

0.094 

134.3 

0.007 

0.071 

0.148 

124.7 

0.476 

0.000 

0.118 

124.7 

0.003 

0.001 

0.076 

134.2 

0.007 

0.018 

0.180 

132.2 

0.100 

0.008 

0.135 

133.1 

0.059 

0.030 

0.070 

125.7 

0.427 

0.040 

0.065 

122.2 

0.614 

0.028 

0.062 

125.5 

0.456 

0.017 

0.106 

131.0 

0.160 

0.018 

0.153 

131.8 

0.120 

0.060 

0.213 

128.5 

0.283 

0.042 

0.090 

124.9 

0.462 

0.029 

0.142 

130.1 

0.206 

0.001 

0.189 

134.3 

0.004 

0.019 

0.159 

131.8 

0.120 

0.029 

0.110 

128.9 

0.263 

0.062 

0.050 

111.7 

1.245 

0.000 

0.245 

134.4 

0.000 

0.000 

0.129 

134.4 

0.002 

0.048 

0.254 

130.4 

0.191 
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Remark:  *  denotes  the  influential  data. 

/-8.05  \  /  -6.95 

0.25  0.23 

0.52  ,  0  =  0.55 

-0.15  -0.15 

V  0.15  /  V  0.15 


The  effect  on  0  of  deleting  the  observation  number  29,  is  shown  in  the  above  two 

A  A  A 

values  of  /?,  .  and  0.  The  big  change  takes  place  in  0o. 

— (29)  — 
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